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HUMAN SPASMOLYTIC POLYPEPTIDE IN GLYCOSYLATED FORM 
FIELD OF INVENTION 

i 

The present invention relates to human spasmolytic polypeptide 
in glycosylated form, variants of human and porcine spasmolytic 
5 polypeptides and a method of producing spasmolytic polypeptides 
in glycosylated form, 

BACKGROUND OF THE INVENTION 

Human spasmolytic polypeptide (HSP) belongs to a family of 
peptides containing one or more characteristic trefoil domains 

10 [1]. The trefoil domain is made up t of a sequence of 38 or 39 
amino acid residues in which 6 cystein residues are linked in 
the configuration 1-5, 2-4 and 3-6 thus forming a 
characteristic trefoil structure [1]. The trefoil family of 
peptides consists of rat intestinal trefoil factor, ITF [2], 

15 human breast cancer associated peptide, pS2 [3,4,5], porcine, 
human and murine spasmolytic polypeptide (PSP, HSP, MSP) 
[6,7,8] and frog spasmolysins (xPl, xP2 and xP4) [8,10,11] all 
containing 1, 2 or 4 trefoil domains (Fig. 1). 

The physiological function of the trefoil peptides is poorly 
20 understood, and so far only PSP has been studied in any detail. 
In the porcine pancreas, PSP is found in the acinar cells and 
to be secreted in large amounts (50-100 mg/ml) into the 
pancreatic juice upon stimulation with pancreozymin or secretin 
[12,13,14]. PSP is resistant to digestion by intestinal 
25 proteases in the gastrointestinal tract [12], and specific 
binding of PSP to rat intestinal mucosa cells and membrane 
preparations from these cells has been demonstrated [15,16]. In 
the porcine gastrointestinal tract, specific receptor-like 
binding to Paneth cells in the duodenum has been found [17]. 
30 These results suggest a unique intraluminal function of the 
peptide. A pharmacological screening has indicated that PSP has 
spasmolytic and gastric acid secretion inhibitory effects [18], 
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and studies on mammalian cells have indicated a growth factor- 
like activity of PSP [19]. 



The DNA sequence and derived amino acid sequence of the human 
counterpart of porcine SP is shown in [8]. Unlike PSP, human SP 
5 (Fig. 2), has been found to be expressed in the stomach, but 
not in the pancreas to any greater extent [8]. An increased 
expression of HSP and pS2 has been reported to be associated 
with peptic ulcers and mucosal injury in inflammatory bowel 
disease [20,21] indicating a possible healing function of these 
10 peptides. 

Only very limited amounts of HSP can be prepared by extraction 
of human tissue. An object of study resulting in the present 
invention was therefore to prepare recombinant HSP in 
sufficient amounts for physiological and biochemical studies of 
15 the peptide. 

SUMMARY OF THE INVENTION 

It has surprisingly been found that when recombinant HSP is 
produced in certain host organisms, a proportion of it is 
produced in glycosylated form by posttranslational 
20 modifications. The glycosylated form of HSP has not, to 
applicant's best knowledge, been described previously. 

Accordingly, the present invention relates to human spasmolytic 
polypeptide (HSP) which has the amino acid sequence 

Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu Ser Pro His Asn Arg 
25 Thr Asn Cys Gly Phe Pro Gly He Thr Ser Asp Gin Cys Phe Asp Asn 
Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
Arg Arg Asn Cys Gly Tyr Pro Gly He Ser Pro Glu Glu Cys Ala Ser 
Arg Lys Cys Cys Phe Ser Asn Phe He Phe Glu Val Pro Trp Cys Phe 
30 Phe Pro Asn Ser Val Glu Asp Cys His Tyr (SEQ ID NO:l) 
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or a functionally equivalent homologue thereof, characterized 
by being in glycosylated form. 

In the; present context, the term "functionally equivalent" is 
intended to indicate that the homologous polypeptide has a 
5 biological activity (e.g. spasmolytic effect) corresponding to 
that of native HSP. The term "homologue" is intended 1 to 
indicate a polypeptide encoded by DNA which hybridizes to the 
same probe as the DNA coding for HSP under comditions of high 
or low stringency (e.g. as described in Sambrook et. al., 

10 Molecular Cloning: A Laboratory Manual , 2nd Ed., Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). More 
specifically, the term is intended to refer to a DNA sequence 
which is at least 60% homologous to the sequence encoding HSP 
with the amino acid sequence shown above. The term is intended 

15 to include modifications of the DNA sequence such as nucleotide 
substitutions which do not give rise to another amino acid 
sequence of the polypeptide, but which correspond to the codon 
usage of the host organism into which the DNA construct is 
1 introduced or nucleotide substitutions which do give rise to a 

20 different amino acid sequence and therefore, possibly, a 
different protein structure which might give rise to a mutant 
polupeptide with different properties than the native enzyme. 
Other examples of possible modifications are insertion of one 
or more codons into the sequence, addition of one or more 

25 codons at either end of the sequence, or deletion of one or 
more codons at either end or within the sequence. The term 
"glycosylated" is intended to indicate that a carbohydrate 
moiety is present at one or more sites of the protein molecule. 

It is at present contemplated that glycosylation of HSP may 
30 give rise to differences in the biological activity of the 
protein, for instance with respect to stability towards 
proteolytic enzymes in the gastrointestinal tract, solubility 
at gastric and/or intestinal pH compared to non-glycosylated 
HSP, antigenicity, half-life, tertiary structure, and targeting 
35 to receptors on appropriate cells. 



BNSDOCID: <WO 9417102A1_I_> 



WO 94/17102 PCT/DK94/00037 



In another aspect, the present invention relates to a variant 
of a spasmolytic polypeptide (SP) which is a fragment of human 
spasmolytic polypeptide (HSP) or porcine spasmolytic 
polypeptide (PSP) comprising at least one trefoil domain. 

5 The variant SP may be provided in both glycosylated and non- 
glycosylated form. It is at present contemplated that such a 
variant may be advantageous to use instead of full-length SP 
because of a higher specific biological activity, increased 
solubility and stability, longer half-life, easier way of 
10 production, or the like. 

It is assumed that other spasmolytic polypeptides than HSP 
will, if provided with a glycosylation site, also be expressed 
in predominantly glycosylated form. In a further aspect, the 
present invention therefore relates to a method of preparing a 

15 spasmolytic polypeptide in at least 60% glycosylated form, 
wherein a host cell transformed with a DNA fragment encoding a 
spasmolytic polypeptide and capable of providing glycosylation 
of said spasmolytic polypeptide is cultured under conditions 
permitting production of said spasmolytic polypeptide and 

20 recovering the resulting spasmolytic polypeptide from the 
culture. 

DETAILED DESCRIPTION OF THE INVENTION 

It has been found that, at least when recombinant. HSP is 
produced in yeast, the proportion of it that is provided in 

25 glycosylated form is in N-glycosylated form. It has further 
been found that glycosylation takes place at Asnl5 of the 
seguence shown above. In preferred embodiments of glycosylated 
HSP, the glycosylated side chain contains at least one hexose 
unit. In particular, the glycosylated side chain may contain at 

30 least one mannose unit, preferably at least five mannose units, 
most preferably at least ten mannose units. In one preferred 
embodiment of glycosylated HSP of the invention, the 
glycosylated side chain contains 13-17 mannose units. In other 
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preferred embodiments, the glycosylated HSP is in addition 
glycosylated with at least one unit of N-acetyl glucosamine 
(GlcNAc) . In the currently preferred embodiment, the 
glycosylated HSP is glycosylated at Asnl5 with (GlcNAc) 2 (Man) 10 _ 

5 15 • 

It is further contemplated to produce homologues of HSP which 
are provided with one or more additional glycosylation sites. 
Thus, the present invention also relates to HSP homologues, 
wherein Lys2 is replaced by Asn, Gln7 is replaced by Asn, ArglO 

10 is replaced by Asn, Gly 20 is replaced by Thr or Ser, Gly23 is 
replaced by Asn, lie 24 is replaced by Asn, Phe 36 is replaced 
by Asn, Asp 37 is replaced by Asn, Ser39 is replaced by Asn, 
Gln53 is replaced by Asn, Glu61 is replaced by Asn, Asp64 is 
replaced by Asn, Arg66 is replaced by Thr or Ser, Gly69 is 

15 replaced by Thr or Ser, Gly72 is replaced by Asn, lie 89 is 
replaced by Thr or Ser, Pro98. is replaced by Asn or VallOl is 
replaced by Thr or Ser, or a combination of two or more of 
these substitutions. In a currently preferred embodiment of 
such an HSP homologue, Asp64 is replaced by Asn, and Arg66 is 

2 0 replaced by Thr or Ser. 

It is of course understood that HSP homologues of the invention 
may be glycosylated in the same manner at one or more of these 
sites as described above for glycosylation at Asnl5. 

It is assumed that the trefoil structure common among 
25 spasmolytic polypeptides is important for the function of HSP 

and PSP. The variant human or porcine SP comprising a fragment 

of the full-length polypeptide should therefore include at 

least three disulfide bonds to provide this structure. 

Consequently, the variant may comprise at least a sequence of 
30 amino acids from position 8 to 46 or from position 58 to 95, 

each of which sequences defines a trefoil domain of HSP and 

PSP. 
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provided in no„-g ly cos y lated for,. This may, for instonce * be 
accomplished by substituting Asnl5 by another amino acid e g 
Asp or Glu, or by substituting Thrl7 by another amino' acid 
5 except ser, e.g. It ls „ ore Ukely# ^ 

.ore additicna! glycosylate sites win be introduced into 
this domain, for instance by replacing Arglo by Asn, sly 20 by 
Thr or ser, Gly 2 3 by Asn. He 24 by Asn. Phe 36 by Asn, Asp 37 

10 th Z\ ^ ASn ' ^ 8 C0 " bI "«-" of two or more of 

10 these substitutions. 

On the other hand, it may be desirable to provide the trefoil 
domain fro, position 58 to 95 with a glyoosylation site lacking 
« this domain in native HSP and PSP. Thus, Glu6l may be 

15 IZ ITl by Asn ' ASP64 by Asn ' Arg66 by Thr or s -' «y« ^ 

15 Thr or ser, or Cl y72 is replaced by ^ Qr q J 
two or more of these substitutions, in . currently preferred 
embodiment of the variant, Asp64 is re pi aced by Asn ' L ^ 
is replaced by Thr or Ser. 

20 b! ll ^ C r rSe UnderSt ° 0d that Va ~*nts of the invention may 
20 be glycosylated in the same manner at one or more of these 
sites as described above for glyoosylation at Asnl5 in full- 
length HSP. un 

A DHA seguence encoding HSP may suitably be isolated fro. a 

human genomic DNA library by POP (polymerase chain reaction, 

25 cloning using primers based on the published cDHA seguence ,.j 

Alternatively, the DHA seguence may be prepared synthetic 1 y 

by established standard methods, e.g. the phcsphoamidite method 
desc rlbed by S . L . Beaucage and ^ ^ 

30 „atthes et a!., E„B0 JO n r nM 3, 1M4 , pp . 801 . 805 . kcc * 
the phcsphoamidite method, oligonucleotides are synthesized, 
e.g. m an automatic DNA synthesizer, purified, annealed 

n 9 ,r rv? loned , in sultabie veotors - The cdna -»■•»- sho„; 

in CS) may be used as the basis of oligonucleotide synthesis. 
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Alternatively, it is possible to use cDNA coding for HSP 
obtained by screening a human cDNA library with oligonucleotide 
probes in accordance with well-known procedures. 

Furthermore, the DNA sequence may be of mixed synthetic and 
5 genomic, mixed synthetic and cDNA or mixed genomic and cDNA 
brigin prepared by ligating fragments of genomic, synthetic' or 
cDNA origin (as appropriate) , the fragments corresponding to 
various parts of the entire DNA sequence, in accordance with 
standard techniques. 

10 The SP variant of the invention may be encoded by a fragment 
of the full-length DNA sequence, prepared by one of the methods 
indicated above, or by , suitably truncating the full-length 
sequence. 

The DNA sequence encoding HSP or an SP variant of the invention 
15 may then be inserted in a suitable expression vector. The 
recombinant expression vector may be any vector which may 
" conveniently be subjected to recombinant DNA procedures, and 
the choice of vector will often depend on the host cell into 
which it is to be introduced. Thus, the vector may be an 
20 autonomously replicating vector, i.e. a vector which exists as 
an extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g. a plasmid. 
Alternatively, the vector may be one which, when introduced 
into a host cell, is integrated into the host cell genome and 
25 replicated together with the chromosome (s) into which it has 
been integrated. 

In the vector, the DNA sequence encoding HSP or an SP variant 
of the invention should be operably connected to a suitable 
promoter sequence. The promoter may be any DNA sequence which 
30 shows transcriptional activity in the host cell of choice and 
may be derived from genes encoding proteins either homologous 
or heterologous to the host cell. Examples of suitable 
promoters for directing the transcription of the DNA encoding 
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the inhibitor of the invention in cells „. ^ sv 

promoter (Subramani et ,1.. ttU^CellJioU x „„ " " 

.«). «,. MT-1 .-taiiotHionl—eneT^o^r P .:Lr t " 
a ScieBce a22. 19.3. pp. S0S-S14) or the adenovirus 2 major 

Lr 0 " 0 *"- SUUable Pr °" 0terS U " *» »-* cells 
J Biol P r° terS 9lyCOlVtlC «"™ < Hit ^" * a 

**' 1980 ' PP ' 12073 - 120 "-- ana KawasaM 

LtaMe romlrs^sft ^la T P ~" 
, n fllame ntous fungus host cells 

15 r ; n.rr? 0 r ^ promoter (McKni9ht ™ 

1, 1985, pp. 2093-2099) or the tpi A promoter. 

The DNA sequence encoding HSP or an sp - 
opera.lv connected to a suitable termini I " ^ 
9to« h hormone terminator (Paimiter et al . , ^ ci , or ^ 
fungal hosts) the TPH fA lbPr * n * v ■ (f ° r 

20 (McKnight et al ^ c £ » ' ^ ^ « ™ 

9 et al., £1^) promoters. The vector may further- 

equences (e.g. the ones encoding adenovirus VA RNAs) . 

25 The recombinant expression vector may further comprise a DNA 
sequence enabling the vector to replioate in the host eel! 
gue t, A „ _ ples o£ a seguen=e <uhen -host 

a mammaUan eel!, is the SV ,0 origin of re plic atio„. or when 

30 nes rjt' I! ^ ^ ^ 2 » 

^ er ' e.g. a gene the product o«f ttk^k 
complements a defect in the host cell, such as the 

drug, e.g. neomycin, hygromycin or 
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methotrexate, or the Schizosaccharomvces pombe TPI gene 
(described by P.R. Russell, Gene 40, 1985, pp. 125-130. 

The procedures used to ligate the DNA sequences coding for HSP 
or the 1 SP variant, the promoter and the terminator, 
5 respectively, and to insert them into suitable vectors 
containing the information necessary for replication, are well 
known to persons skilled in the art (cf., for instance, 
Sambrook et al., Molecular Cloning: A Laboratory Manual , Cold 
Spring Harbor, New York, 1989). 

10 The host cell into which the expression vector is introduced 
may be any cell which is capable of producing the inhibitor of 
the invention and is preferably a eukaryotic cell, such as a 
mammalian, yeast or fungal cell. 

The yeast organism used as the host cell may be any yeast 
15 organism which, ' on cultivation, produces large quantities of 
the inhibitor of the invention. Examples of suitable yeast 
organisms are strains of the yeast species Saccharomvces 
cerevisiae , Saccharomvces kluvveri . Schizosaccharomvces pombe 
or Saccharomvces uvarum. The transformation of yeast cells may 
20 for instance be effected by protoplast formation followed by 
transformation in a manner known per se. 

Examples of suitable mammalian cell lines are the COS (ATCC CRL 
1650), BHK (ATCC CRL 1632, ATCC CCL 10) or CHO (ATCC CCL 61) 
cell lines. Methods of transfecting mammalian cells and 

25 expressing DNA sequences introduced in the cells are described 
in e.g. Kaufman and Sharp, J. Mol . Biol. 159, 1982, pp. 601- 
621; Southern and Berg, J. Mol. AppI . Genet. 1, 1982, pp. 327- 
341; Loyter et al. , Proc. Natl. Acad. Sci. USA 79 , 1982, pp. 
422-426; Wigler et al., Cell 14/ 1978, p. 725; Corsaro and 

30 Pearson, Somatic Cell Genetics 7, 1981, p. 603, Graham and van 
der Eb, Virology 52, 1973, p. 456; and Neumann et -al., EMBO J. 
1, 1982, pp. 841-845. 
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Alternatively, fungal cells may be used as host cells. Examples 
of suitable fungal cells are cells of filamentous fungi e g 
ASperq i]1 "*- Sp P- or "^ospora spp., in particular strains of 
Aspergillus orvzae or Asperg illus mger. The use of A^pergillos 
5 spp. for the expression of proteins is described in e a ep 
238 023. ' * V * ' 



According to the present method, yeast cells are currently 
preferred for producing HSP and other SPs (such as those shown 
an Fig. i) , as they have surprisingly been found to produce SP 
10 in a high yield and in at least 60% glycosylated form. For 
instance, about two thirds of the HSP produced by yeast may be 
recovered in glycosylated form. 

The medium used to cultivate the cells may be any conventional 
medium suitable for growing mammalian cells or fungal 

15 (including yeast) cells, depending on the choice of host cell 
The spasmolytic polypeptide will be secreted by the host cells 
to the growth medium and may be recovered therefrom by 
conventional procedures including separating the cells from the 
medium by centrifugation or filtration, precipitating the 

20 protemaceous components of the supernatant or filtrate by 
means of a salt, e.g. ammonium sulfate, purification by a 
variety of chromatographic procedures, e.g. ion exchange 
chromatography or affinity chromatography, or the like. 

The present invention also relates to a pharmaceutical 
25 composition comprising HSP or a variant spasmolytic polypeptide 
of the invention together with a pharmaceutical^ acceptable 
carrier or excipient. in the composition of the invention, the 
variant may be formulated by any of the established methods of 
formulating pharmaceutical compositions, e.g. as described in 
30 yeninqt "" ,B Pharmaceutical Sciences , 1985. The composition may 
typically be in a form suited for oral or rectal administration 
and may, as such, be formulated as tablets or suppositories 
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HSP or an SP variant of the invention is contemplated to be 
useful for the prophylaxis or treatment of gastrointestinal 
disorders. More specifically, it , is contemplated for the 
treatment of gastric or peptic ulcers, inflammatory bowel 
5 disease/ Crohn's disease or injury to the intestinal tract 
caused by radiation therapy, bacterial or other infections, 
etc. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is further illustrated in the following example 
10 with reference to the appended drawings in which 

Fig. 1 shows the trefoil family of peptides. Intestinal trefoil 
factor (ITF) contains one trefoil domain [2], as does the 
breast cancer associated pS2 peptide [3,4]. The spasmolytic 
polypeptides from man, pig and mouse contain two trefoil 
15 domains [1,8]. Spasmolysins from Xenopus laevis contain one or 
four trefoil domains [10]. Recently, a member of the frog 
trefoil family containing two domains has been described [11]. 

Fig. 2 shows the proposed structure of human spasmolytic 
polypeptide, HSP. The primary amino acid sequence is taken from 
20 Tomasetto et al. [8], and the disulphide bonds are placed in 
homology to PSP [1]. 

Fig. 3 shows the nucleotide sequence (SEQ ID NO: 2) and 
corresponding amino acid sequence (SEQ ID NO: 3) of the 563 bp 
EcoRI - Xbal fragment encoding the leader - HSP fusion protein. 

25 The Kex 2 processing site is indicated by a vertical arrow. The 
leader and the PCR cloned parts of the HSP gene are shown in 
capital letters, while the synthetic parts are shown i small 
letters. The underlined sequences correspond to the PCR primers 
with horizontal arrows indicating the direction. Restriction 

30 sites relevant for the construction are shown. 
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Mg. . shows the s. cerevisi M p la s„id tor the expression and 
secretin o £ HSP. TPi-prom. and ^ _ > ° - 

tr,osephosphate isomerase transcription promoter ana ter^ina^ 
sequences respectively. POT is a selective marXer, the 
5 M^fllttSB^ triosephosphate isomerase gene Only 
restr.ctron sites relevant for the construction of the plas.id 
have been indicated. ^ iQ 

veLV h ° US / eVerSed " PhaSe " PLC °" 3 Vydac 2 " Tt>5 < of 
10 an r, br ° th - tU ° P " kS """Ponding to r-HSP 

10 and glycosylated r-HSP are indicated. The dashed line shows th. 

concentration of acetonitrile in the eluting solvent. 

Fig. 6 shows ion exchance chromatography on a Past Flow s 
co umn of concentrated yeast supernatant. The amount of r-HSP 
end glycosylated r-HSP were determined hy the use of the HP^ 

ZTT T n ^ ^ ^ tactions pooL^ 

dash d PU """ tiOT ° £ ^HSP and glycosylated r-HSP. The 

dashed 11B . shows the concentration of Kacl in th . Qlu J£ 
solvent. For details, see material and Methods. 9 

Fig. 7 shows the final purification of r-HSP 
,0 glycosylated r-HSP (8) on a preparative reversed-ph s ™ 

Vydac 2 1«TP10 22 column. The hars indicate the fractions pooled 
. for lyophilrzation. The dashed lines show the concentration of 

acetonitrile in the eluting solvent. For details, see Material 

and Methods. JJateriai 

25 Fig. 8 shows reversed-ohasp hpt r> ™ „ 

fied, glyoosylate/ ™ "and T^Z^ZZl 
a"" " e — — - acetonitrile' in the eluttg 

Fig. 9 shows mass spectra of purified r-HSP (A and B) and 
10 glycosylated r-HSP ,c and D) . Fig. A and FiJ. c "sh 

res! I T S SPeCtrU " ° f r " HSP a " d glycosylated r-HSP, 
respectively. Fig. B and Fig. D show the reconstructed mass 
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spectrum for r-HSP and glycosylated r-HSP on the basis of Fig. 
, A and Fig. C. 

EXAMPLE 

MATERIAL AND METHODS 
5 General methods 

Standard DNA techniques were used as previously described [29]. 
Synthetic oligonucleotides were prepared on an automatic DNA 
synthesizer (380B, Applied Biosystems) using commercially 
available reagents. DNA sequence determinations were performed 
10 by the dideoxy chain-termination technique [30]. Polymerase 
chain reactions (PCR) were performed on a DNA Thermal Cycler 
(Perkin Elmer Cetus) using a commercial kit (GeneAmp, Perkin 
Elmer Cetus) . 

PCR cloning of HSP 

15 The first trefoil domain of HSP was isolated by a PCR reaction 
in which 1 fig human genomic DNA (Clontech, Palo Alto, CA, USA) 
was used as a template. The reaction mixture contained 100 
pmole each of the forward primer 1 (GGCTGAGCCCCCATAACAG) (SEQ 
ID NO: 4) and reverse primer 2 (TGGAAACACCAGGGGAC) (SEQ ID NO: 5) 

20 (Fig. 3) and was carried out in a 100 /il volume. The cycle was 
: 94°C for 1 min, 50°C for 1 min, and 72°C for 1 min. After 30 
cycles a final cycle was performed in which the 72°C step was 
maintained for 10 min. The PCR product, a 115 bp fragment, was 
isolated by electrophoresis on a 2% agarose gel. 

25 The 115 bp PCR fragment was digested with Ddel and then ligated 
to a 31 bp duplex formed from the oligonucleotides 
(GAGAAACCCTCCCCCTGCCAGTGCTCCAGGC) (SEQ ID NO: 6) and 
(TCAGCCTGGAGCACTGGCAGGGGGAGGGTTTCTC) . The ligation product was 
amplified by PCR using forward primer 3 

30 (GCTGAGAGATTGGAGAAGAGAGAGAAACCCTCCCCCT) (SEQ ID NO: 7) and 
reverse primer 2. The 3 1 part of primer 3 is identical to the 
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N-terminal encoding part of the HSP gene and the 5- part of 
primer 3 is identical to the C-terminal encoding part of the 
hybrid leader gene (Fig. 3). In-frame fusion of the hybrid 
leader gene and the first trefoil domain from HSP was obtained 
5 by overlay extension PCR [31]. The product was digested with 
EcoRI and Avail and isolated as a 360 bp DNA fragment. 

The second trefoil domain of HSP was PCR-cloned from human 
genonuc DNA as described for the first domain by replacing 
primers l and 2 with forward primer 4 (TGCGTCATGGAGGTCTC) (SE q 
10 ID N0:8) and reverse primer 5 (AGCACCATGGCACTTCAAAG) (SEQ ID 
NO:9) (Fig. 3). Reverse primer 5 introduces a Ncol site as a 
salent mutation. The PCR product, a 115 bp fragment, was 
ablated and digested with Ddel and Ncol resulting in a 91 bp 
fragment. To this fragment were ligated two synthetic duplexes. 
15 The first, encoding the amino acid sequence between the two 
trefoil domains, consisted of the oligonucleotides 
(GTCCCCTGGTGTTTCCACCCCCTCCCAAAGCAAGAGTCGGATCAGTGCGTCATGGAGGTC) 
(SEQ ID NO: 10) and (TGAGACCTCCATGACGCACTGATCCGACTCTTGCT- 
TTGGGAGGGGGTGGAAACACCAGGG) (SEQ ID NOrll). The second, a 46 bp 
2 0 Ncol - Xbal fragment encoding the C-terminal part of „SP 
consisted of the oligonucleotides ( CATGGTGCTTCTTCCCGAACTCTGT- 
GGAAGACTGCCATTACTAAGT) (SEQ ID NO-12) and 

(CTAGACTTAGTAATGGCAGTCTTCCACAGAGTTCGGGAAGAAGCAC) (SEQ ID 

25wa 0 s 13) \Tr AVaI1 di9eSti ° n 3 bP AVa " " XbaI '"9-ent 

25 was isolated. 



A DNA construct encoding the hybrid loader fused in . frame to 
the entire HSP gene was obtained by ligation of the 360 bp 
Beora - Av.il fragment and the 105 bp Avail - xbal fragment 

3 0 IZl t0 kb EOORI " XtaI f »»-»t *«- v-tor 

30 pT31 9R [32) . This construct was then transfor-ed into E. coli 

strain m-172 ,r, *•) by selection for resistance 

a-piorllin. DNA sequencing of the resulting plas„id, KFN-1843 

showed that the correct construction had been obtained. 
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construction of the HSP secreting yeast strain 

Plasmid KFN-1843 described above was digested with EcoRI and 
Xbal. The resulting 558 bp fragment was isolated and ligated to 
the 9.3 «kb Ncol - Xbal fragment and the 1.6 kb Ncol - EcoRI 
5 fragment both from the yeast expression vector pMT-636. Plasmid 
pMT-636 is derived from the S. cerevisiae - E. coli shuttle 
vector CPOT [25,33] by deletion of the 0.4 kb Hpal - Nrul 
fragment from the Leu-2 gene. The ligation mixture was trans- 
formed into E. coli strain MT-172, and the HSP expression 
10 plasmid, KFN-1847, was isolated (Fig. 4). Plasmid pKFN-1847 was 
transformed into S. cerevisiae strain MT-663 by selection for 
growth on glucose as the sole carbon source. One transf ormant , 
KFN-1852, was selected for fermentation. 

Fermentation 

15 The transf ormant described above was cultivated at 30 °C for 3 
days in yeast pep.tone dextrose ( YPD) medium [40] supplied with 
additional yeast extract (60 g/1) . An OD 650 nm value of 52 was 
reached at the end of the fermentation. 

Purification of r-HSP 

2 0 The concentration of r-HSP in the yeast fermentation broth and 
fractions obtained during the purification was measured by 
analytical HPLC Aliquots (usually 50-200 ^1) were injected 
onto a Vydac 214TP54 reverse-phase C4 HPLC column (0.46 x 25 
cm) equilibrated at 30° C at a flow rate of 1.5 ml/min with 0.1% 

25 (v/v) TFA in 5% (v/v) acetonitrile. The concentration of 
acetonitrile in the eluting solvent was raised to 65% (v/v) 
over 3 0 min. Absorbance was measured at 280 nm. The peaks 
eluting at 15.6 min. and 16.1 min. (Fig. 5) was found by mass 
spectrometry analysis to represent glycosylated r-HSP and 

30 unglycosylated r-HSP, respectively. The peptides were 
quantified using a calibrated PSP sample as standard as both 
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peptides contain two Trp and two Tyr out of 106 amino acid 
residues . 

From a 10 litre fermentor, 8 litres of fermentation broth was 
isolated by centrif ugation at 3,000 rpm for 10 min. The 
5 supernatant was concentrated to 0.9 litre using an Amicon 
ultrafiltration unit (RA 2000) equipped with an Amicon spiral 
ultrafiltration cartridge type S1Y3, MW cutoff 3,000 (Product 
No. 540620). The pH was adjusted to 1.7 and the conductivity in 
the resulting concentrated sample was measured to 4.7 mS. 

10 The sample was pumped onto a Fast Flow S-Sepharose (Pharmacia) 
column (5 x 11 cm) with a flow rate of 40 ml/h. 'Previous to the 
application, the column was equilibrated in 50 mM formic acid 
buffer, pH = 3.7. After application' of the sample, the column 
was washed with 500 ml of 50 mM formic acid buffer, pH = 3.7. 

15 The. peptides were eluted from the column by a linear gradient 
between 1.5 litres of 50 mM formic acid buffer, pH = 3.7 and 
1.5 litres of 50 mM formic acid buffer, pH = 3.7 containing 0.6 
M NaCl. Fractions of 10 ml was collected at a flow rate of 40 
ml/h and the absorbance was measured at 280 nm. Fractions were 

20 assayed for the content of r-HSP and glycosylated r-HSP in the 
HPLC-system previously described. The elution profile is shown 
in Fig. 6. Fractions corresponding to r-HSP (fract. Nos. 107- 
128) and glycosylated r-HSP (fract. Nos. 78-95), respectively, 
were pooled. 

25 Glycosylated r-HSP and r-HSP were further purified by 
preparative HPLC chromatography. Pooled fractions (approx. 200 
ml) were pumped onto a Vydac 214TP1022 C4 column (2.2 x 25 cm) 
equilibrated in 0.1% (v/v) TFA. The column was washed with 100 
ml of 0.1% (v/v) TFA in 10% (v/v) MeCN . The peptides were 

30 eluted at 25'C and at a flow rate of 5 ml/min with a linear 
gradient (650 ml) formed from MeCN/H 2 0/TFA (10.0:89.9:0.1 v/v/v) 
and MeCN/H 2 0/TFA (60.0:39.9:0.1 v/v/v). UV-absorption was 
monitored at 280 nm, and fractions corresponding to 10 ml were 
collected and analysed for the content of r-HSP or glycosylated 
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r-HSP. Fig! 7 shows the preparative HPLC purification of r-HSP 
(Fig. 7A) and glycosylated r-HSP (Fig. 7B) . Fractions cor- 
responding to the bars were pooled, and the volume reduced to 
30% by vacuum centrif ugation. From the two resulting pools, r- 
5 HSP and glycosylated r-HSP were isolated by lyophilization. 

Characterization of r-HSP and glycosylated r-HSP 

Amino acid composition analysis were carried out by hydrolysis 
of 50 /ig peptide with 6M HC1 for 24 h at 110 C C as previously 
described [6]; no correction for loss during hydrolysis was 
10 carried out. Amino acid sequence analysis was determined by 
automated Edman degradation using an Applied Biosystems Model 
470A gas-phase sequencer [22]. , Carbohydrate composition 
analysis was carried out by hydrolysis of 50 /xg peptide with 2M 

HC1 for lh, 2h and 4h at 100 °C and monosaccharides were 

i 

15 separated on a CarboPac PAI (Dionex, Sunnyvale, CA) column (4 
x 250 mm) eluted with 14 mM NaOH. The monosaccharides were 
detected by pulsed amperometric detection (Dionex PAD- 
detector) . The amount of monosaccharides was corrected to zero 
time of hydrolysis and calculated as nmol of monosaccharide per 

20 nmol of peptide. 

Mass spectrometry analysis was performed using an API III 
LC/MS/MS system (Sciex, Thornhill, Ontario, Canada). The triple 
quadrupole instrument has a mass-to-charge (m/z) range of 24 00 
and is fitted with a pneumatically assisted electrospray (also 

25 referred to as ion-spray) interface [23,24]. Sample 
introduction was done by a syringe infusion pump (Sage 
Instruments, Cambridge, MA) through a fused capillary (75 /xm 
i.d.) with a liquid flow-rate set at 0.5-1 /il/min. The 
instrument m/z scale was calibrated with the singly-charged am- 

30 monium adduct ions of poly (propylene glycols) (PPG's) under unit 
resolution. 

The accuracy of mass measurements was generally better than 
0.02%. 
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RESULTS 



Expression and purification 

DNA fragments encoding the two trefoil domains of HSP were 
isolated by PGR from human genomic DNA using primer, based on 
5 the published cDNA sequence [8]. The full length HSP gene was 
obtained from the PCR cloned fragments by addition of synthetic 
DNA fragments. The HSP gene was fused in-frame to a hybrid 
yeast leader sequence by overlay extension PCR [31] (Fig 3) 
The hybrid leader is based on the mouse salivary amylase signal 
10 peptide [34] and the S i _j a uyveri a mating factor leader 
sequence [35] and is further modified near the Kex 2 cleavage 
site for efficient processing [36, 41], 

The yeast expression plasmid pKFN-1847 contains the leader-HSP 
gene inserted between the S_^visiae triose phosphate 
15 isomerase promoter and terminator [37]. The expression vector 
(Fig. 4) also contains the Sc^izosaccharnn,^ 

(POT) [38]. 9 ne 

The plasmid was transformed into the yeast strain MT-663 
carrying a deletion in the TPI gene, by selecting for growth on 
20 glucose. 



The expression level of r-HSP in the present yeast system is 
approx. 120 mg/1. As can be seen from Fig. 5, the yeast su- 
pernatant contains two forms of r-HSP; one eluting at R = 15 6 
mm. and one eluting at R t - l6ml min . These tWQ ^ ^ 
25 purified separately, and by using the analytical HPLC-system 
(Fig. 5), these two forms can be quantified individually during 
the different steps of the purification. 

After the initial concentration of the yeast supernatant by 
ultrafiltration, the first purification step was cationic 
30 exchange chromatography on a Fast Flow s column. Fig. 6 shows 
the elution profile from the column including the amount of r- 
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HSP and glycosylated r-HSP determined in the fractions, A 
complete separation of the two forms of r-HSP was obtained in 
this step. 

The fractions from the Fast Flows S column were pooled as 
5 indicated in Fig. 6, and the two peptides were further purified 
by preparative HPLC (Fig. 7). The r-HSP and glycosylated r-HSP 
were recovered from the fractions indicated in Fig. 7A and Fig. 
7B by vacuum centrif ugation and lyophilization . The 
purification is summarized in Table 1. The overall yield' of r- 
10 HSP and glycosylated r-HSP from 8 litres of fermentation broth 
was 160 mg and 219 mg corresponding to' 50% and 34%, 
respectively. 

Characterization of r-HSP and glycosylated r-HSP 

Fig. 8 shows the purity of r-HSP and glycosylated r-HSP as 
15 analysed by analytical HPLC. From these results none of the 
peptides looks completely pure. However, upon rechromatography 
of material eluting in the minor as well as the major peak, 
similar chromatograms were obtained for both peptides (results 
not shown) . This seems to indicate that the double peak 
20 observed for both r-HSP and glycosylated r-HSP reflects an 
atypical behaviour of these peptides on reverse phase columns 
' rather than impurities in the preparations. 

Table 2 shows the amino acid sequencing results obtained on r- 
HSP and glycosylated r-HSP. The average repetitive yield was 

25 94.4% (r-HSP) and 94.6% (glycosylated r-HSP), respectively. In 
both cases the first 40 residues of the two peptides were 
confirmed by the sequence analysis. In the glycosylated HSP, no 
PTH-a.a. was found in Edman degradation cycle No. 15. The HSP 
sequence from residue 15-17 ( Asn-Arg-Thr) corresponds to a 

30 consensus sequence for N-glycosylation of Asn-15. 

The carbohydrate composition analysis of glysocylated r-HSP 
showed the presence of 12.8 nmol mannose (Man) and 1.6 nmol of 
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N-acetyl glucoseamine (GlcNAc) per nmol of r-HSP. By peptide 
mapping of r-HSP and glycosylated r-HSP in combination with 
mass spectrometry and sequencing analysis (results not shown) , 
no other residue besides Asn-15 of the glycosylated r-HSP was 
5 found to be modified, i.e. no O-glycosylation was found. 

in Fig. 9, the electro-spray mass spectrometry (ESMS) analysis 
is shown for r-HSP and glycosylated r-HSP. Fig. 9 A and 9C are 
original mass spectra displaying characteristics series of 
multiply charged protonated ions always observed in ESMS 

10 spectra of proteins. Fig. 9B and 9D are the corresponding 
computer reconstructed mass spectra from which the molecular 
weight of individual components may be read directly. As can be 
seen from Fig. 9B, the MW found for r-HSP is 11961.5 ±2 which 
is in very good agreement with a calculated mass of 11961 3 

15 Fig. 9D shows the reconstructed ion spray mass spectrum of the 
glycosylated r-HSP. From the sequence analysis and the 
carbohydrate composition analysis, it is known that only Asn-15 
is glycosylated and that only two monosaccharide residues 
mannose and N-acetyl glucoseamine, occur in the glycosylated 
20 form of r-HSP. From these results in combination with the mass 
spectrometry data, it is possible to deduce the different 
glycosylated forms of r-HSP (Table 3). 

Molecular weights corresponding to two series of carbohydrate 
side chains can be deduced from the combination of carbohydrate 

25 composition data and ISMS-data, namely (GlcNAc) 2 (Hex) 10 . 15 and 
(Hex) 13 . 17 (Table 3). As mannose is the only hexose in the glyco- 
sylated r-HSP, and as Asn-15 is the only glycosylated residue 
it seems reasonable to conclude that the structure of the 
glycosylation site is Asn- (GlcNAc),- (Man) The observed Asn- 

30 <H.x) 13 . 17 forms are thus most likely to arise from fragmentation 
in the mass spectrometer, by which the two GlcNAc residues lose 
an acetyl group and are converted into two hexoses. 
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The structure of Asn- (GlcNAc) 2 - (Man) 1CM5 has previously been 
reported as high mannose type of N-glycosylation for other 
peptides and proteins expressed in yeast [26]. 

Table 1 , 

5 Purification of r-hSP and glycosylated r-hSP from yeast 
supernatant 







Amount [mg] 


Yield [%] 


STEP 


Volume 
[ml] 


r-hSP 


glycosylated 
r-hSP 


r-hSP 


glycosylated 
i-hSP 


Yeast supernatant 


8000 


320, 


640 


100 


100 


Ultrafiltration 


900 


207 


405 


65 


63 


Ion exchange Pool 1 
chromatography 

Pool 2 


160 

, 220 


182 


275 


57 


43 


Prep HPLC Pool 1 


54 




219 




34 


Pool 2 


80 


160 




50 
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Cycle No. 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 



PTH-a.a. 



Glu 
Lys 
Pro 
Ser 
Pro 
(Cys) 
Gin 
(Cys) 
Ser 
Arc 
Leu 
Ser 
Pro 
His 
Asn 
Arg 
Thr 
Asn 

Gly 
Phe 
Pro 
Gly 
He 
Thr 
Ser 
Asp 
Gin 
(Cys) 
Phe 
Asp 
Asn 
Gly 
(Cys) 
(Cys) 
Phe 
Asp 
Ser 
Ser 
Val 



Yield (pmol) 



r-hSP 



4304 
6925 
6027 
2890 
4336 
ND 
3388 
ND 
1279 
1876 
2277 
877 
1545 
517 
1202 
959 
978 
1066 
ND 
836 
993 
843 
785 
640 
589 
274 
581 
445 
ND 
623 
483 
369 
359 
ND 
ND 
422 
26S 
127 
145 
298 



glycosylated r- 
hSP 



8853 
8292 
12837 
5602 
8802 

ND 
5689 

ND 
2417 
2523 
4290 
1790 
2963 

574 

0* 
1471 
2172 
1509 

ND 
1857 
1958 
1839 
2049 
1400 
1454 
621 
1391 
952 

ND 
1562 
1210 
823 
885 
ND 
ND 
1094 
783 
324 
394 
827 



ND; Not determined 

*: No trace of PTH-Asn or PTH-Asp was seen in cycle No. 15 of glycosylated 
r-hSP. 
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Structure Calculated MW found by 

MW ESMS (Fig.9D) 

hSP + 2 GlcNAc + 10 Man 13989.1 '■ 13989.5 

hSP + 2 GlcNAc + 11 Man 14151.2 14151.0 

hSP + 2 GlcNAc + 12 Man 14313.4 14313.5 

hSP '+ 2 GlcNAc + 13 Man 14475.5 14475.0 

hSP + 2 GlcNAc + 14 Man 14639.7 14639.5 , 

hSP + 2 GlcNAc + 15 Man 14799.8 14801.5 

hSP + 13 Man 14069.1 14072.0 

hSP + 14Man 14231.3 14232.5 

hSP + 15 Man 14393.4 14393.0 

hSP+l6Man 14555.5 14557.5 

hSP + 17 Man 14717.7 14720.0 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Novo Nordisk h/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : 2880 

(G) TELEPHONE: +45 4444 8888 

(H) TELEFAX: +45 4449 3256 

(I) TELEX: 37304 

(ii) TI^OF INVENTION: Hurran Spasmolytic Polypeptide in Glycosylated 

(iii) NUMBER OF SEQUENCES: 14 

(iv) O0MFUIER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC qanpatiMe 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTV2ARE: Ffctentln Release $1.0, Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 106 amino acids 

(B) TYPE: ainino acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu Ser Pro His Asn Ara 
1 5 10 15 

Thr Asn Cys Gly Pfce Pro Gly He Ihr Ser Asp Gin Cys Hie Asp Asn 



30 
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Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
35 40 45 

Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
50 55 60 

Arg Arg Ash Cys Gly Tyr Pro Gly He Ser Pro Glu Glu Cys Ala Ser 
65 ~ ** 70 75 80 

Arg Lys Cys Cys Phe Ser Asn Phe' He Phe Glu Val Pro Trp Cys Phe 
85 90 95 

Phe Pro Asn Ser Val Glu Asp Cys His Tyr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETTCAL: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 77.. 553 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide . 

(B) LOCATION: 77.. 235 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 236.. 553 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAATTCCATT CAAGAATAGT TCAAACAAGA AGATTACAAA CTATCAATTT CATACACAAT 60 

ATAAAOGAOC AAAAGA ATG AAG GCT GTT TTC TIG GTT TTG TCC TTG ATC 109 
Met Lys Ala Val Phe Leu Val Leu Ser Leu He 
-53 " -50 -45 

GGA TTC TGC TGG GCC CAA CCA GTC ACT GGC GAT GAA TCA TCT GTT GAG 157 
Gly Phe Cys Trp Ala Gin Pro Val Thr Gly Asp Glu Ser Ser Val Glu 
-40 -35 -30 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Lys Ala Val Phe Leu Val Leu Ser Leu He Gly Phe Cys Trp Ala 
~ 53 -50 -45 _ 4Q 

Gin Pro Val Thr Gly Asp Glu Ser Ser Val Glu He Pro Glu Glu Ser 
-35 -30 -25 

Leu He He Ala Glu Asn Thr Thr Leu Ala Asn Val Ala Met Ala Glu 
-20 -15 _ 10 



205 



301 



ATT CCG GAA GAG TCT CTG ATC ATC GCT GAA AAC ACC ACT TIG GCT AAC 
He Pro Glu Glu Ser Leu He He Ala Glu Asn Thr Thr Leu Ala Asn 
-25 -20 -is 

GTC GCC ATG GCT GAG AGA TTG GAG AAG AGA GAG AAA CCC TCC CCC TGC 253 
Val Ala Met Ala Glu Arg Leu Glu Lys Arg Glu Lys Pro Ser Pro Cys 
-10 -5 1 5 

CAG TGC TCC AGG CTG AGC CCC CAT AAC AGG ACG AAC TGC GGC TTC CCT 
Gin Cys Ser Arg Leu Ser Pro His Asn Arg Thr Asn Cys Gly Phe Pro 
10 15 * 20 

GGA ATC ACC AGT GAC CAG TCP TTT GAC AAT GGA TGC TCT TTC GAC TCC 349 
Gly He Thr Ser Asp Gin Cys Phe Asp Asn Gly Cys Cys Phe Asp Ser 
25 30 35 

AGT GTC ACT GGG GTC CCC TGG TCT TTC CAC CCC CTC CCA AAG CAA GAG 397 
Ser Val Thr Gly Val Pro Trp Cys Phe His Pro Leu Pro Lys Gin Glu 
40 45 50 

TOG GAT CAG TGC GTC ATG GAG GTC TCA GAC AGA AGA AAC TGT GGC TAC 
Ser Asp Gin Cys Val Met Glu Val Ser Asp Arg Arg Asn Cys Gly Tyr 
55 60 65 7 0 

COG GGC ATC AGC CCC GAG GAA TGC GCC TCT CGG AAG TGC TGC TTC TCC 
Pro Gly He Ser Pro Glu Glu Cys Ala Ser Arg Lys Cys Cys Phe Ser 
75 80 85 

^ C P° ™° 61X3 ^ 105 1X30 ^ ^ CCG AAC TCT GTC GAA 541 

Asn Phe He Phe Glu Val Pro Trp Cys Phe Phe Pro Asn Ser Val Glu 
90 95 100 

GAC TCC CAT TAC TAAGTCTAGA 

Asp Cys His Tyr 3 
105 



445 
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Arg Ibu Glu Lys Arg Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu 
-5 1 . 5 10 

Ser Pro His Asn Arg Thr Asn Cys Gly Phe Pro Gly lie Thr Ser Asp 
15 20 25 

Gin Cys Phe Asp Asn Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val 
30 35 40 

Pro Trp Cys Phe His Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val 
45 50 55 

Met Glu Val Ser Asp Arg Arg Asn Cys Gly Tyr Pro Gly He Ser Pro 
60 65 70 75 

Glu Glu Cys Ala Ser Arg Lys Cys Cys Phe Ser Asn Phe He Phe Glu 
80 85 ( 90 

Val Pro Trp Cys Phe Phe Pro Asn Ser Val Glu Asp Cys His Tyr 
95 100 105 

(2) BJP0KMATION FOR SEQ ID NO: 4: 

■ (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

. (A) ORGANISM: synthetic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGCTGAGCCC CCATAACAG 19 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TCGAAACACC AGGGGAC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIDGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAGAAACCCT CCCCCTGCCA GTGCTCCAGG C 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCAGCCIGGA GCACTCGCAG GGGGAGGGTT TCTC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(vi). ORIGINAL SOURCE: 

1 (A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCIGAGAGAT TCGAGAAGAG AGAGAAACCC TCCCCCT 37 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGCCTCATGG AGGTCTC 17 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGCACCATGG CACTTCAAAG 20 
(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTCTCCTGCT GITTCCACCC CCTCCCAAAG CAAGAGTCGG ATCAGTCOGT CATGGAGGTC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCAGACCTCC ATCAOGCACT GATCCGACTC TTCCTTTCGG AGGGGGIGGA AACACCAGGG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CATGGTCCTT CITCCOGAAC TCTGTGGAAG ACTCCCATTA CTAAGT 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 46 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: .synthetic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CTAGACTTAG TAATCGCAGT CTTCCACAGA GTTCGGGAAG AAGCAC 46 
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CLAIMS 

1. Human spasmolytic polypeptide (HSP) which has the amino acid 
sequence 

Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu Ser Pro His Asn Arg 
5 Thr Asn Cys Gly Phe Pro Gly lie Thr Ser Asp Gin Cys Phe Asp Asn 
Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
Arg Arg Asn Cys Gly Tyr Pro Gly lie Ser Pro Glu Glu Cys Ala Ser 
Arg Lys Cys Cys Phe Ser Asn Phe He Phe Glu Val Pro Trp Cys Phe 
10 Phe Pro Asn Ser Val Glu Asp Cys His Tyr (SEQ ID NO:l) 

or a functionally equivalent homologue thereof, characterized 
by being in glycosylated form. 

i 

2. HSP according to claim 1, characterized by being in N- 
glycosylated form. 

15 3. HSP according to claim 1 or 2 , which is glycosylated at 
Asnl5 . 

4. HSP according to claim 3, in which the glycosylated side 
chain contains at least one hexose unit. 

5. HSP according to claim 4, in which the glycosylated side 
20 chain contains at least one mannose unit, preferably at least 

five mannose units, most preferably at least ten mannose units. 

6. HSP according to claim 5, in which the glycosylated side 
chain contains 13-17 mannose units. 

7. HSP according to claim 4, which in addition is glycosylated 
25 with at least one unit of N-acetyl glucosamine (GlcNAc) . 
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8. HSP, according to claim 7, which is glycosylated with 
■ (GlcNAc) 2 (Man) 10 . 15 - 

9. HSP according to claim 1, wherein Lys2 is replaced by Asn, 
Gln7 is replaced by Asn, ArglO is replaced by Asn, Gly 20 is 

5 replaced by Thr or Ser, Gly23 is replaced by Asn, lie 24 is 
replaced by Asn, Phe 36 is replaced by Asn, Asp 37 is replaced 
by Asn, Ser39 is replaced by Asn, Gln53 is replaced by Asn, 
Glu61 ia replaced by Asn, Asp64 is replaced by Asn, Arg66 is 
replaced by Thr or Ser, Gly69 is replaced by Thr or Ser, Gly72 
10 is replaced by Asn, lie 89 is replaced by Thr or Ser, Pro98 is 
replaced by Asn or VallOl is replaced by Thr or Ser., or a 
combination of two or more of these substitutions. 

10. HSP according to claim 9, wherein Asp64 is replaced by Asn, 
and wherein Arg66 is replaced by Thr or Ser. 



15 11. HSP according to claim 9, in which the glycosylated Asn 
contains at least one hexose unit. 

12. HSP according to claim 11, which is glycosylated with at 
least one mannose unit, preferably at least five mannose units, 
most preferably at least ten mannose units. 

20 13. HSP according to claim 12, which is glycosylated with 13-17 
mannose units. 

14. HSP according to claim 11, which in addition is 
glycosylated with at least one unit of N-acetyl glucosamine 
(GlcNAc) . 

25 15. HSP according to claim 14, which is glycosylated with 
(GlcNAc) 2 (Man) 10 . 15 . 

16. A variant of a spasmolytic polypeptide which is a fragment 
of human spasmolytic polypeptide (HSP) or porcine spasmolytic 
polypeptide (PSP) comprising at least one trefoil domain. 
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17. A variant according to claim 16, which comprises at least 
a sequence of amino acids from position 8 to 46 or from 
position 58 to 95. 

18. A variant according to claim 16 or 17, wherein Asnl5 is 
5 substituted by another amino acid, e.g. Asp or Glu. ' 

19. A variant according to claim 16 or 17, wherein Thrl7 is 
substituted by another amino acid except Ser, e.g. Ala. 

20. A variant according to claim 16 or 17, wherein ArglO is 
replaced by Asn, Gly 20 is replaced by Thr or Ser, Gly23 is 

10 replaced by Asn, He 24 is replaced by Asn, Phe 36 is replaced 
by Asn, Asp 37 is replaced by Asn, Ser39 is replaced by Asn, 
Glu61 ia replaced by Asn, Asp64 is replaced by Asn, Arg66 is 
replaced by Thr or Ser, Gly69 is replaced by Thr or Ser, or 
Gly72 is replaced by Asn, or a combination of two or more of 

15 these substitutions. 

21. A method of preparing a spasmolytic polypeptide in at least 
60% glycosylated form, wherein a host cell transformed with a 
DNA fragment encoding a spasmolytic polypeptide and capable of 
providing glycosylation of said spasmolytic polypeptide is 

20 cultured under conditions permitting production of said 
spasmolytic polypeptide and recovering the resulting 
spasmolytic polypeptide from the culture. 

22. A method according to claim 21, wherein the host cell is a 
fungal cell such as a yeast cell, e.g. a strain of 

25 Saccharomyces cerevisiae. or a filamentous fungus cell, e.g. a 
strain of Aspergillus , such as Aspergillus ory zae. 

23. A pharmaceutical composition comprising HSP according to 
any of claims 1-15 together with a pharmaceutical^ acceptable 
carrier or excipient. 
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24. A pharmaceutical composition comprising a variant 
spasmolytic polypeptide according to any of claims 16-20 
together with a pharmaceutical^, acceptable carrier or 
excipient . 

5 25. Use of HSP according to any of claims 1-15 for the 
manufacture of a medicament for the prophylaxis or treatment of 
gastrointestinal disorders. 

26. Use of a variant spasmolytic polypeptide according to any 
of claims 16-20 for the manufacture of a medicament for the 
10 prophylaxis or treatment of gastrointestinal disorders. 
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EcoRI 

GAATTCCATTCAAGAATAGTTCAAACAAGAAGATTACAAACTATCAATTTCATACACAAT 



ATAAACGACCAAAAGAATGAAGGCTGTTTTCTTGGTTTTGTCCTTGATCGGATTCTGCTG 
MKAVFLVLSLIGFCW 

GGCCCAACCAGTCACTGGCGATGAATCATCTGTTGAGATTCCGGAAGAGTCTCTGATCAT 
A-QPVTGDESSV.EIP. EESLII 

CGCTGAAAACACCACTTTGGCTAACGTCGCCATGGCTGAGAGATTGGAGAAGAGAgagaa 
AENTTLANVAMAERLEKRjEK 

Dde I 1 , T 

accctccccctgccaatactcca oacTGAGCCCCCATAACAG GACGAACTGCGGCTTCCC 
PSPCQCSRLSPHNRTNCGFP 

TGGAATCACCAGTGACCAGTGTTTTGACAATGGATGCTGTTTCGACTCfcAGTGTCACTGG 
G ITSDQCFDNGCCFD. SSVTG 

Avail < 2 4 > 

G crfccccctQqtcrbttcca ccccctccGaaaqcaaqacrtcqqatcag tqcqtcatqqaacrt 
VPWCFHPLPKQESDQCVMEV 

Dde I 

cTCAGACAGAAGAAACTGTGGCTACCCGqGCATCAGCCCCGAGGAATGCGCCTCTCGGAA 
S DRRNCGYPG-I SPEECASRK 

«. 5 Nco I 
GTGCTGCTTCTCCAACTTCAT CTT^^ 

CCFSNFIFEVPWCFFPNSVE 
Xba T 

agactgccattactaagtCTAGA 
D C H Y 



Fig. 3 



BNSDOCID: <WO 9417102A1_I_> 



PCT/DK94/00037 




WO 94/17102 



PCT/DK94/00037 



5/9 




Time (min) 



Fig. 5 
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Fraction no. (voI.= 10 ml) 
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Fig. 7 
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